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2.1.3 Unordered Sampling without Replacement: 
Combinations 

Here we have a set with n elements, e.g., A = {1, 2, 3,....«} and we want to draw k samples 
from the set such that ordering does not matter and repetition is not allowed. Thus, we basically 
want to choose a ^-element subset of A, which we also call a ^-combination of the set A. For 
example if A = {1,2, 3} and k = 2, there are 3 different possibilities: 

1 . { 1 , 2 }; 

2. {1,3}; 

3. {2,3}. 

We show the number of ^-element subsets of A by 

0 

This is read "n choose k." A typical scenario here is that we have a group of n people, and we 

n 

would like to choose k of them to serve on a committee. A simple way to find ( , ) is to compare 

fl 

it with Pj,. Note that the difference between the two is ordering. In fact, for any //-element subset 
of A = {1,2, 3,.... n }, we can order the elements in k\ ways, thus we can write 



Therefore, 

Q n ! 

k\(n ~ k )!' 


Note that if k is an integer larger than n, then ( ) = 0. This makes sense, since if k > n there is 
no way to choose k distinct elements from an //-element set. 
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( t 


) 


is also called the binomial coefficient. This is because the coefficients in the binomial 


n 

theorem are given by ( ^ ). In particular, the binomial theorem states that for an integer n > 0, we 
have 


n 

(a + b) n = Y ( n \a k b n ~ k . 

*= 0 W 

Note: There are several different common notations that are used to show the number of k- 
combinations of an //-element set including C n k , C(n, k), C " k , nCk, etc. In this book, we always 

n 

us e(*). 


Example 2.6 

I choose 3 cards from the standard deck of cards. What is the probability that these cards contain 
at least one ace? 

• Solution 

o Again the phrase "at least" suggests that it might be easier to first find P{A C ), the 
probability that there is no ace. Here the sample space contains all possible ways to 
choose 3 cards from 52 cards, thus 


5 



There are 52 - 4 = 48 non-ace cards, so we have 


Thus 


A c 



} 


P(A) 




Example 2.7 

How many distinct sequences can we make using 3 letter "A"s and 5 letter "B"s? (AAABBBBB, 
AABABBBB, etc.) 
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• Solution 

o You can think of this problem in the following way. You have 3+5 = 8 positions to 
fill with letters A or B. From these 8 positions, you need to choose 3 of them for As. 
Whatever is left will be filled with Bs. Thus the total number of ways is 

8 
3 

Now, you could have equivalently chosen the locations for Bs, so the answer would 
have been 


Thus, we conclude that 



The same argument can be repeated for general n and k to conclude 

You can check this identity directly algebraically, but the way we showed it here is interesting in 
the sense that you do not need any algebra. This is sometimes a very effective way of proving 
some identities of binomial coefficients. This is proof by combinatorial interpretation. The basic 
idea is that you count the same thing twice, each time using a different method and then 
conclude that the resulting formulas must be equal. Let us look at some other examples. 


Example 2.8 

Show the following identities for non-negative integers k and m and n, using combinatorial 
interpretation arguments. 

1. We have !*=<>(*) = 2". 

2 - For 0 < k < n, we have (” + j ) = (J l ) + (” )• 

m + n , m n 

3. We have ( ) = X/ = o ( • )(,_ . ) (Vandermonde's identity). 

• Solution 
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o 1. To show this identity, we count the total number of subsets of an //-element 

set A. We have already seen that this is equal to 2" in Example 2.3 . Another 
way to count the number of subsets is to first count the subsets with 0 
elements, and then add the number of subsets with 1 element, and then add 
the number of subsets with 2 elements, etc. But we know that the number of k 

n 

-element subsets of A is ( k ), thus we have 
2 n = Number of subsets of A 

= Xv^oNumber of //-element subsets of A (2.1) 

= I*=o(I). 

We can also prove this identity algebraically, using the binomial theorem, 

(a + b) n = Y!k=o{ k ) ak b n ~ k - If we let a = b = 1, we obtain 2" = X^=o(^) - 

2. To show this identity, let's assume that we have an arbitrary set A with n + 1 
distinct elements: 


A {ci j, & 2 i ^ 3 , * * * 5 +1 ^ * 

We would like to choose a k+ 1-element subset B. We know that we can do 

n + 1 

this in ( , + ! ) ways (the right hand side of the identity). Another way to count 

the number of k + 1-element subsets B is to divide them into two non¬ 
overlapping categories based on whether or not they contain a n+x . In 

particular, if a n+l £ B, then we need to choose k + 1 elements from 

n 

{flj, a 2 , a 3 ,..., a n } which we can do in ( , + 1 ) different ways. If, on the 
other hand, a n + x e B, then we need to choose another k elements from 

n 

{aj, a 2 , a 3 ,..., a n } to complete B and we can do this in ( ^ ) different ways. 
Thus, we have shown that the total number of k + 1-element subsets of an 

n n 

n + 1-element set is equal to ( ) + ( ). 

3. Here we assume that we have a set A that has m + n elements: 

A= {a x , a 2 , a 3 ,..., a m , b x ,b 2 ,..., b n }. 

m + n 

We would like to count the number of A-elemcnt subsets of A. This is ( ). 

Another way to do this is first choose i elements from {a x , a 2 , a 3 ,, a m } 
and then k - i elements from {b x , b 2 ,. .., b n }. This can be done in 
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rn n 

( • ) ( k _ . ) number of ways. But i can be any number from 0 to k, so we 

m + n , m n 

conclude ( t ) = , )(*_,)■ 


Let us now provide another interpretation of ( , ). Suppose that we have a group of n people and 

we would like to divide them two groups A and B such that group A consists of k people and 
group B consists of n - k people. To do this, we just simply need to choose k people and put 
them in group A, and whoever is left will be in group B. Thus, the total number of ways to do 

n 

this is (*). 


The total number of ways to divide n distinct objects into two groups A and B such that 

n 

group A consists of k objects and group B consists of n - k objects is ( , ). 


Note: For the special case when n = 2k and we do not particularly care about group names A and 

1 n 

B, the number of ways to do this division is y ( ) to avoid double counting. For example, if 22 

players want to play a soccer game and we need to divide them into two groups of 11 players, 

1 22 

there will be y ( n ) ways to do this. The reason for this is that, if we label the players 1 to 22, 
then the two choices 


A = {1,2,3,..., 11} and B= {12, 13, 14,..., 22}, 

A = {12, 13, 14,..., 22} and B = {1, 2, 3,..., 11} 
are essentially the same. 

For example, we can solve Example 2.1 in the following way: We have 8 blank positions to be 
fdled with letters "A" or "B." We need to divide them into two groups A and B such that group A 
consists of three blank positions and group B consists of 5 blank spaces. The elements in group 
A show the positions of "A"s and the elements in group B show the positions of "B"s. Therefore 

8 

the total number of possibilities is ( ). 

Bernoulli Trials and Binomial Distribution: 

Now, we are ready to discuss an important class of random experiments that appear frequently in 
practice. First, we define Bernoulli trials and then discuss the binomial distribution. A Bernoulli 
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Trial is a random experiment that has two possible outcomes which we can label as "success" 
and "failure," such as 

• You toss a coin. The possible outcomes are "heads" and "tails." You can define "heads" as 
success and "tails" as "failure" here. 

• You take a pass-fail test. The possible outcomes are "pass" and "fail." 

We usually denote the probability of success by p and probability of failure by q = 1 - p. If we 
have an experiment in which we perform n independent Bernoulli trials and count the total 
number of successes, we call it a binomial experiment. For example, you may toss a coin n times 
repeatedly and be interested in the total number of heads. 


Example 2.9 

Suppose that I have a coin for which P{H) = p and P( T) = 1 - p. I toss the coin 5 times. 

a. What is the probability that the outcome is THHHHl 

b. What is the probability that the outcome is HTHHHl 

c. What is the probability that the outcome is HHTHH1 

d. What is the probability that I will observe exactly four heads and one tails? 

e. What is the probability that I will observe exactly three heads and two tails? 

f. If I toss the coin n times, what is the probability that I observe exactly k heads and n - k 
tails? 


• Solution 


o a. To find the probability of the event A = {THHHH}, we note that A is the 

intersection of 5 independent events: A = first coin toss is tails, and the next 
four coin tosses result in heads. Since the individual coin tosses are 
independent, we obtain 

P(THHHH) = p(T) x p(H) x p(H) x p{H) x p(H) 

= (1 ~P)P 4 . 


b. Similarly, 

P{HTHHH) = p(H) x p(T) x p(H) x p(H) x p(H) 

- (i -p!p 4 


c. Similarly, 

P(HHTHH) = p(H) x p(H) x p(T) x p(H) x p(H) 
= (1 ~P)P 4 - 


d. Let B be the event that I observe exactly one tails and four heads. Then 
B = { THHHH , HTHHH , HHTHH , HHHTH , HHHHT}. 

Thus 

P(B) = P {THHHH) + P{HTHHH) + P{HHTHH) + P{HHHTH) + P{HHHHT) 
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(1 ~p)p 4 + (1 ~p)p 4 + (1 ~p)p 4 + (1 ~p)p 4 + (1 ~p)p 4 

5/(1 -p). 


e. Let C be the event that I observe exactly three heads and two tails. Then 


C = {TTHHH, THTHH, THHHTH ,. . ., HHHTT]. 


Thus 

P(C) = P(TTHHH) + P{THTHH) + P{THHTH) + ... + P{HHHTT) 


(1 ~ p) 2 p 2 + (1 ~ p) 2 p 2 + (1 ~ p) 2 p 2 + • • • + (1 ~ p) 2 p 2 

\c\p\l-p) 2 . 


But what is \C\1 Luckily, we know how to find | C |. This is the total number 
of distinct sequences that you can create using two tails and three heads. This 
is exactly the same as Example 2.1 . The idea is that we have 5 positions to fill 
with letters H or T. From these 5 positions, you need to choose 3 of them for 
Hs. Whatever is left is going to be filled with Ts. Thus the total number of 
5 

elements in C is ( ), and 



f. Finally, we can repeat the same argument when we toss the coin n times and 
obtain 



Note that here, instead of writing P{k heads and n - k tails), we can just write 
P(k heads). 


Binomial Formula: 

For n independent Bernoulli trials where each trial has success probability p, the probability 
of k successes is given by 


m= C 
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Multinomial Coefficients: 

n 

The interpretation of the binomial coefficient ( ^ ) as the number of ways to divide n objects into 

two groups of size k and n - k has the advantage of being generalizable to dividing objects into 
more than two groups. 


Example 2.10 

Ten people have a potluck. Five people will be selected to bring a main dish, three people will 
bring drinks, and two people will bring dessert. Flow many ways they can be divided into these 
three groups? 

• Solution 

o We can solve this problem in the following way. First, we can choose 5 people for 

10 

the main dish. This can be done in ( ) ways. From the remaining 5 people, we 

then choose 3 people for drinks, and finally the remaining 2 people will bring 
desert. Thus, by the multiplication principle, the total number of ways is given by 

/10w5 w2\ 10! 5! 2! 10! 

[ 5 / \3 / \2 / 5 !5 ! ' 3 !2 ! ' 2!0! ~ 5 !3 !2 !' 


This argument can be generalized for the case when we have n people and would like to divide 
them to r groups. The number of ways in this case is given by the multinomial coefficients. In 
particular, if n = n l + n 2 + ... + n r , where all n . > 0 are integers, then the number of ways to 

divide n distinct objects to r distinct groups of sizes n x , n 2 , ■.., n r is given by 

/ n \ n ! 

\n j, n 2 , ..., n r ) n x \n 2 \. . . n r \ ' 

We can also state the general format of the binomial theorem, which is called the multinomial 
theorem: 


(*i 


+ x 2 + 


+x,r = 


I ( n 

+ ...+„ =n \n x ,n 2 ,...,n r ) 


( 2 . 2 ) 


Finally, the binomial fonnula for Bernoulli trials can also be extended to the case where each 
trial has more than two possible outcomes. 


Example 2.11 
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I roll a die 18 times. What is the probability that each number appears exactly 3 times? 

• Solution 

o First of all, each sequence of outcomes in which each number appears 3 times has 
probability 



Flow many distinct sequences are there with three l's, three 2's, and three 6's? 
Each sequence has 18 positions which we need to fdl with the digits. To obtain a 
sequence, we need to choose three positions for l's, three positions for 2's, ..., and 
three positions for 6's. The number of ways to do this is given by the multinomial 
coefficient 


( 18 1 

\3, 3, 3, 3, 3, 3 / 


18! 

3 !3 !3 !3 !3 !3 ! ’ 


Thus the total probability is 


18! /I \18 
(3 !) 6 \ 6 j 


We now state the general form of the multinomial formula. Suppose that an experiment has r 
possible outcomes, so the sample space is given by 

S = {s h s 2 , • • • 

Also suppose that P(s t ) = p t for i = 1, 2,..., r. Then for n = n l + n 2 + ... + n r independent 
trials of this experiment, the probability that each s { appears n- times is given by 


/ n \ 

\n x ,n 2 ,... ,n r / 1 ' 


p 2 ...p r 


n \ \n 2 \. 


n,A 


P\P2 


n 

■Pr 


<— previous 
next —> 
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